pa ° eIof7 s 



1647 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/005, 318C 



DATE: 02/06/2001 
TIME: 15:57:09 



Input Set : 
Output Set: 



A:\40Icl.app 

N:\CRF3\02 062001\I0 0 5318C.raw 



SEQUENCE LISTING 

4 (1) GENERAL INFORMATION: 

C--> 6 (i) APPLICANT: Hein, Mich B, 

7 Hiatt, Andrew c. 

8 Fitchen , John H . 

10 (li) TITLE OF INVENTION: NOVEL EPITHELIAL TISSUE TARGETING AGENT 

12 (ill) NUMBER OF SEQUENCES: 140 

14 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: SEED IP LAW GROUP PLLC 

16 (B) STREET: 701 Fifth Avenue, Suite 6 3 00 

17 (C) CITY: Seattle 

18 (D) STATE: Washington 

19 (E) COUNTRY: USA 

20 (F) ZIP: 98104 

2 2 (v) COMPUTER READABLE FORM: 

2 3 (A) MEDIUM TYPE: Floppy disk 

2 4 (B) COMPUTER: IBM PC compatible 

2 5 (C) OPERATING SYSTEM: PC- DOS/MS -DOS 

26 (D) SOFTWARE: Patent! n Release #1.0, Version #1.30 

28 (vi) CURRENT APPLICATION DATA: 

C--> 29 (A) APPLICATION NUMBER: US/09/005 , 3 1 8C 

C--> 30 (B) FILING DATE: 09-Jan-1998 

31 (C) CLASSIFICATION: 

3 3 (viii) ATTORNEY/AGENT INFORMATION: 

34 (A) NAME: Christiansen, William T. 

35 (B) REGISTRATION NUMBER: 44,614 

36 (C) REFERENCE/DOCKET NUMBER: 310098 . 4 01C1 

3 8 (ix) TELECOMMUNICATION INFORMATION: 

39 (A) TELEPHONE: (206) 622-4900 

40 (B) TELEFAX: (206) 682-6031 

4 3 (2) INFORMATION FOR SEQ ID NO: 1: 
45 (i) SEQUENCE CHARACTERISTICS: 

4 6 (A) LENGTH: 137 amino acids 

4 7 (B) TYPE: amino acid 

4 8 (C) STRANDEDNESS : 

4 9 (D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

57 Gin Glu Asp Glu Arg He Val Leu Val Asp Asn Lys Cys Lys Cys Ala 

58 1 5 10 15 

60 Arg He Thr ser Arg He He Arg Ser Ser Glu Asp Pro Asn Glu Asp 

61 20 25 30 

6 3 He Val Glu Arg Asn He Arg lie He Val Pro Leu Asn Asn Arg Glu 

64 35 40 45 

66 Asn He Ser Asp Pro Thr Ser Pro Leu Arg Thr Arg Pro Val Tyr His 

67 5 0 5 5 60 

69 Leu Ser Asp Leu Cys Lys Lys Cys Asp Pro Thr Glu Val Glu Leu Asp 

70 65 70 75 80 



ENTERED 

RECEIVED 
FEB 20 2001 
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72 




Asn 


Gin 


He 


Val 


Thr 


Ala Thr 


Gin Ser 


Asn 


He 


Cys 


Asp 


Glu 


Asp 


Ser 


73 












85 






90 










95 




75 




Ala 


Thr 


Glu 


Thr 


Cys 


Tyr Thr 


Tyr Asp 


Arg 


Asn 


Lys 


Cys 


Tyr 


Thr 


Ala 


76 










100 






105 










110 






78 




Val 


Val 


Pro 


Leu 


Val 


Tyr Gly 


Gly Glu 


Thr 


Lys' 


Met 


Val 


Glu 


Thr 


Ala 


79 








115 








120 








125 








81 




Leu 


Thr 


Pro 


Asp 


Ala 


Cys Tyr 


Pro Asp 
















82 






:i3Q 








1.35 


















84 


(2) 


INFORMATION FOR SEQ ID NO: 2: 


















86 




<i-> 


SEQUENCE 


CHARACTERISTICS 


















87 






(A) 


LENGTH : 


13 5 


amino acids 
















88 






(B) 


TYPE: amino 


acid 


















89 






(C) 


STRANDEDNESS : 


















90 






(D) 


TOPOLOGY: linear 


















96 




(xi) 


SEQUENCE 


DESCRIPTION: SEQ ID NO: 


2 : 














9 8 




Gin 


Asp 


Glu 


Asn 


Glu 


Arg He 


Va l. val 


Asp 


Asn 


Lys 


Cys 


Lys 


Cys 


Ala 


99 




1 








5 






10 










15 




101 




Arg 


lie 


Thr 


Ser 


Arg 


He He 


Pro Ser 


Ala 


Glu 


Asp 


Pro 


Ser 


Gin 


ASp 


10 2 










20 






25 










3 0 






104 




lie 


val 


Glu 


Arg 


Asn 


Val Arg 


He He 


Val 


Pro 


Leu 


Asn 


Ser 


Arg 


Glu 


105 








35 








40 








4 5 








10 7 




Asn 


He 


Ser 


Asp 


Pro 


Thr Ser 


Pro Met 


Arg 


Thr 


Lys 


Pro 


Val 


Tyr 


His 


108 






50 








55 








60 










110 




Leu 


Ser 


ASp 


Leu 


Cys 


Lys Lys 


Cys Asp 


Thr 


Thr 


Glu 


Val 


Glu 


Leu 


Glu 


111 




65 










70 






75 










80 


113 




Asp 


Gin 


Val 


Val 


Thr 


Ala Ser 


Gin Ser 


Asn 


He 


Cys 


Asp 


Ser 


Asp 


Ala 


114 












85 






90 










95 




116 




Glu 


Thr 


Cys 


Tyr 


Thr 


Tyr Asp 


Arg Asn 


Lys 


Cys 


Tyr 


Thr 


Asn 


Arg 


Val 


117 










100 






105 










110 






119 




Lys 


Leu 


Ser 


Tyr 


Arg 


Gly Gin 


Thr Lys 


Met 


Val 


Glu 


Thr 


Ala 


Leu 


Thr 


120 








115 








120 








125 








122 




Pro 


Asp 


Ser 


Cys 


Tyr 


Pro Asp 


















123 






130 








13 5 


















12 6 


(2) 


INFORMATION FOR 


SEQ 


ED NO : 3 


















.128 




(i) 


SEQUENCE CHARACTERISTICS : 
















12 9 






(A) LENGTH 


: 137 amino acids 
















13 0 






(B) TYPE: 


^mino acid 


















131 






(C) STRANDEDNESS : 


















132 






(D) TOPOLOGY: linear 


















13 8 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


: 3 : 














140 




Asp 


Asp 


Glu 


Ala 


Thr 


He Leu 


Ala Asp 


Asn 


Lys 


Cys 


Met 


Cys 


Thr 


Arg 


141 




1 








5 






10 










15 




14 3 




Val 


Thr 


Ser 


Arg 


He 


He Pro 


Ser Thr 


Glu 


Asp 


Pro 


Asn 


Glu 


ASp 


He 


144 










20 






25 










30 






146 




Val 


Glu 


Arg 


Asn 


He 


Arg He 


Val Val 


Pro 


Leu 


Asn 


Asn 


Arg 


Glu 


Asn 


14 7 








3 5 








40 








45 








14 9 




lie 


Ser 


Asp 


Pro 


Thr 


Ser Pro 


Leu Arg 


Arg 


Asn 


Pro 


Val 


Tyr 


His 


Leu 


150 






50 








55 








60 










152 




Ser 


ASp 


Val 


Cys 


Lys 


Lys Cys 


Asp Pro 


Val 


Glu 


Val 


Glu 


Leu 


Glu 


Asp 
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15 3 
155 
156 
158 
159 
161 
162 
164 
165 

167 (2) INFORMATION FOR SEQ ID NO: 4: 



65 70 75 80 

Gin val Val Thr Ala Thr Gin Ser Asn lie Cys Asn Glu Asp Asp Gly 

85 90 95 

val Pro Glu Thr Cys Tyr Met Tyr Asp Arg Asn Lys Cvs Tyr Thr Thr 

100 105 " no 

Met Val Pro Leu Arq Tyr His Gly Glu Thr Lys Met Val Gin Ala Ala 

115 .120 12 5 

Leu Thr Pro Asp Ser Cys Tyr Pro Asp 
130 135 



(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .136 amino acids 

(B) TYPE: amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



210 
211 
212 
213 
214 
220 
222 
223 
225 
226 
W--> 228 
229 
231 
232 



169 
170 
171 
172 
173 

17 9 
181 
182 

18 4 
185 
187 

18 8 
190 
191 

19 3 
194 
19 6 
19 7 
199 
200 
202 
203 
205 
206 

2 08 (2) INFORMATION FOR SEQ ID NO: 5: 



Glu 


Asp 


Glu 


Ser 


Thr 


Val 


Leu 


Val 


Asp 


Asn 


Lys 


Cys 


Gin 


Cys 


Val 


Arg 


1 








5 










10 










15 


lie 


Thr 


Ser 


Arg 


lie 


He 


Arg 


Asp 


Pro 


Asp 


Asn 


Pro 


Ser 


Glu 


Asp 


He 


val 






20* 










25 










30 




Glu 


Arg 


Asn 


lie 


Arg 


lie 


lie 


Val 


Pro 


lieu 


Asn 


Thr 


Arg 


Glu 


Asn 






35 










40 










45 






lie 


Ser 


Asp 


Pro 


Thr 


Ser 


Pro 


Leu 


Arg 


Thr 


Glu 


Pro 


Lys 


Tyr 


Asn 


Leu 


Ala 


50 










55 










60 






Asn 


Leu 


Cys 


Lys 


Lys 


Cys 


Asp 


Pro 


Thr 


Glu 


He 


Glu 


Leu 


Asp 


Asn 


65 










70 










75 








80 


Gin 


Val 


Phe 


Thr 


Ala 


Ser 


Gin 


Ser 


Asn 


He 


Cys 


Pro 


Asp 


Asp 


Asp 


Tyr 










85 










90 










95 


Ser 


Glu 


Thr 


Cys 


Tyr 


Thr 


Tyr 


Asp 


Arg 


Asn 


Lys 


Cys 


Tyr 


Thr 


Thr 


Leu 








100 










105 








110 






Val 


Pro 


lie 
115 


Thr 


His 


Arg 


Gly 


Va 1 
120 


Thr 


Arg 


Met 


Val 


Lys 
125 


Ala 


Thr 


Leu 


Thr 


Pro 
130 


Asp 


Ser 


Cys 


Tyr 


Pro 
135 


Asp 



















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(Xi.) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Glu Gin Glu Tyr He Leu Ala Asn Asn Lys Cys Lys Cys Val Lys lie 

1 5 10 15 

Ser Ser Arg Phe Val Pro Ser Thr Glu Arg Pro Giv Glu Glu He Leu 

20 25 30 

Glu Arg Asn He Gin He Thr He Pro Thr Ser Ser Arg Met Xaa He 

35 40 45 

Ser Asp Pro Tyr Ser Pro Leu Arg Thr Gin Pro Val Tyr Asn Leu Trn 
50 55 60 



SICEiVED 
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234 


Asp 


ne 


Cys 


Gill 


Lys 


Cys 


Asp 


Pro 


val 


Gin 


Leu 


Glu 


lie Gly Gly 


He 


235 


65 










70 










75 




80 


237 


Pro 


Val 


Leu 


Ala 


Ser 


Gin 


Pro 


Xaa 


Xaa 


Ser 


Xaa 


Pro 


Asp Asp Glu 


Cys 


238 










85 










90 






95 


240 


Tyr 


Thr 


Thr 


Glu 


Val 


Asn 


Phe 


Lys 


Lys 


Lys 


Val 


Pro 


Leu Thr Pro 


Asp 


241 








100 










105 








110 


24 3 


Sei- 


Cys 


Tyr 


Glu 


Tyr 


Ser 


Glu 

















24 4 115 
247 (2) INFORMATION FOR SEQ ID NO: 6: 

24 9 (i) SEQUENCE CHARACTERISTICS : 

250 (A) LENGTH: 128 amino acids 

2 51 (B) TYPE: amino acid 

252 (C) STRANDEDNESS : 

253 (D) TOPOLOGY: linear 



259 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


: 6 : 














261 


As n 


Lys 


Cys 


Met 


Cys 


Thr 


Arg 


Val 


Thr 


Ala 


Arg 


He 


Arg 


Gly 


Thr 


Arg 


262 


1 








5 










10 






15 


264 


Glu 


Asp 


Pro 


Asn 


Glu 


Asp 


lie 


Val 


Glu 


Arg 


Tyr 


He 


Arg 


He 


Asn 


Val 


265 








20 










25 








30 






267 


Pro 


Leu 


Lys 


Asn 


Arg 


Gly 


Asn 


lie 


Ser 


Asp 


Pro 


Thr 


Ser 


Pro 


Leu 


Arg 


268 






3 5 










40 










45 






270 


Asn 


Gin 


Pro 


Va J. 


Tyr 


His 


Leu 


Ser 


Pro 


Ser 


Cys 


Lys 


Lys 


Cys 


Asp 


Pro 


271 




50 










55 










60 






273 


Tyr 


Glu 


Asp 


Gly 


Val 


Val 


Thr 


Ala 


Thr 


Glu 


Thr 


Asn 


He 


Cys 


Tyr 


Pro 


274 


65 










70 










75 






80 


276 


Asp 


Gin 


Gly 


Val 


Pro 


Gin 


Ser 


Cys 


Arg 


Asp 


Tyr 


Cys 


Pro 


Glu 


Leu 


Asp 


277 










85 










90 










95 


279 


Arg 


Asn 


Lys 


Cys 


Tyr 


Thr 


val 


Leu 


Val 


Pro 


Pro 


Gly Tyr Thr Gly Glu 


28 0 








100 










105 










110 






282 


Thr 


Lys 


Met 


Val 


Gin 


Asn 


Ala 


Leu 


Thr 


Pro 


Asp 


Ala 


Cys 


Tyr 


Pro 


Asp 


283 






115 










120 










125 







286 (2) INFORMATION FOR SEQ ID NO: 7: 

288 (i) SEQUENCE CHARACTERISTICS: 

289 (A) LENGTH: 421 base pairs 

290 (B) TYPE: nucleic acid 
2 91 (C) STRANDEDNESS : single 
292 (D) TOPOLOGY: linear 
296 (ix) FEATURE: 

29 7 (A) NAME/ KEY: CDS 
2 98 (B) LOCATION: 1. .414 
301 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

303 GAT CAG GAA GAT GAA CGT ATT GTT CTG GTT GAC AAC AAG TGC AAG TGT 

304 Asp Gin Glu Asp Glu Arg He Val Leu Val Asp Asn Lvs Cvs Lys Cvs 

305 1 5 10 " 15 ~ 

30 7 GCT CGT ATT ACT TCT AG A ATC ATC CGT AGC TCA GAG GAC CCA A AT GAA 

308 Ala Arg lie Thr Ser Arg He He Arg Ser Ser Glu Asp Pro Asn Glu 

309 20 25 30 

311 GAT ATA GTC GAA CGT AAC ATC CGT ATC ATC GTC CCA CTG A AT AAC CGG 14 4 

312 Asp lie Val Glu Arg Asn He Arg lie lie Val Pro Leu Asn Asn Arg 



48 



96 
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313 






35 










40 










45 










315 


GAG 


AAT 


ATC 


TCA 


GAT 


CCT 


ACA 


AGT 


CCG 


TTG 


CGC 


ACA 


CGC 


TTC 


GTA 


TAC 


192 


3 I 6 


Glu 


Asn 


T le 


Ser 


Asp 


Pro 


Thr 


Ser 


Pro 


Leu 


Arg 


Thr 


Arg 


Phe 


Val 


Tyr 




317 




50 










55 










60 












319 


CAC 


CTG 


TCA 


GAT 


CTG 


TGT 


AAG 


AAG 


TGT 


GAT 


CCA 


ACA 


GAG 


GTA 


GAG 


CTG 


240 


320 


His 


Leu 


Ser 


Asp 


Leu 


Cys 


Lys 


Lys 


Cys 


ASp 


Pro 


Thr 


Glu 


Val 


Glu 


Leu 




321 


6 5 










70 










7 5 










80 




323 


GAG 


AAT 


CAG 


ATA 


GTC 


ACT 


GCG 


ACT 


CAA 


AGC 


AAC 


ATT 


TGC 


GAT 


GAG 


GAC 


288 


324 


Asp 


Asn 


Gin 


lie 


Val 


Thr 


Ala 


Thr 


Gl n 


Ser 


Asn 


He 


Cys 


Asp 


Glu 


Asp 




325 










8 5 










90 










95 






327 


AGO 


OCT 


ACA 


GAA 


ACC 


TGC 


AGC 


ACC 


TAC 


GAT 


AGG 


AAC 


AAA 


TGC 


TAC 


ACG 


336 


328 


Ser 


Ala 


Thr 


Glu 


Thr 


Cys 


Ser 


Thr 


Tyr 


Asp 


Arg 


Asn 


Lys 


Cys 


Tyr 


Thr 




329 








100 










105 










1 10 








3 3 i 


GCC 


GTG 


GTT 


CCG 


CTC 


GTG 


TAT 


GGT 


GGA 


GAG 


ACA 


AAA 


ATG 


GTG 


GAA 


ACT 


3 84 


3 32 


Ala 


Val 


Val 


Pro 


Leu 


Val 


Tyr 


Gly 


Gly 


Glu 


Thr 


Lys 


Met 


Val 


Glu 


Thr 




3 3 3 






.115 










12 0 










12 5 










335 


GCC 


CTT 


ACG 


CCC 


GAT 


OCA 


TGC 


TAT 


CCG 


GAC 


TGAATTC 










421 


336 


Ala 


Leu 


Thr 


Pro 


Asp 


Ala 


Cys 


Tyr 


Pro 


Asp 
















3 37 




130 










135 






















34 0 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 8: 


















34 2 




( i. ) S EQU E N C E CHAR AC T E R I S T .1 C S : 


















343 






(A) LENGTH: 215 base pairs 
















34 4 






(B) TYPE: 


n ucieie 


acid 


















34 5 






(C) STRANDEDNESS : 


single 


















34 6 






(D) TOPOLOGY: 


linear 




















350 




( ix, 


FEATURE: 


























3 5.1 






(A) NAME/KEY : 


CDS 






















352 






(B) LOCATION: 


1. .213 




















355 




<xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 8 














357 


GAT 


CAG 


AAG 


TGC 


AAG 


TGT 


GCT 


CGT 


ATT 


ACT 


TCT 


AGA 


ATC 


ATC 


CGT 


AGC 


48 


358 


Asp 


Gin 


Lys 


Cys 


Lys 


Cys 


Ala 


Arg 


He 


Thr 


Ser 


Arg 


lie 


He 


Arg 


Ser 




359 


1 








5 










10 










15 






3 61 


TCA 


GAG 


GAC 


CCA 


AAT 


GAA 


GAT 


ATA 


GTC 


GAA 


CGT 


AAC 


ATC 


CGT 


ATC 


ATC 


96 


362 


Ser 


Glu 


Asp 


Pro 


Asn 


Glu 


Asp 


He 


Val 


Glu 


Arg 


Asn 


He 


Arg 


He 


He 




363 








20 










2 5 










3 0 








365 


GTC 


CCA 


CTG 


AAT 


AAC 


CGG 


GAG 


AAT 


ATC 


TCA 


GAT 


CCT 


ACA 


AGT 


CCG 


TTG 


144 


366 


Va 1. 


Pro 


Leu 


Asn 


ASH 


Arg 


Glu 


Asn 


He 


Ser 


Asp 


Pro 


Thr 


Ser 


Pro 


Leu 




367 






3 5 










40 










4 5 










369 


CGC 


AC A 


CGC 


TTC 


GTA 


TAG 


CAC 


CTG 


TCA 


GAT 


CTG 


TGT 


AAG 


AAG 


GAT 


GAG 


192 


370 


Arg 


Thr 


Arg 


Phe 


Val 


Tyr 


His 


Leu 


Ser 


Asp 


Leu 


Cys 


Lys 


Lys 


Asp 


Glu 




371 




50 










55 










60 












373 


GAC 


AGC 


GCT 


ACA 


GAA 


ACC 


TGC 


TG 


















215 


374 


Asp 


Ser 


Ala 


Thr 


Glu 


Thr 


Cys 






















375 


65 










70 
























378 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 9 





















380 (i) SEQUENCE CHARACTERISTICS : 

381 (A) LENGTH: 140 base pairs 

382 (B) TYPE: nucleic acid 

383 (C) STRANDEDNESS: single 
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L:6 M:220 C: Keyword misspelled or invalid format, [(i) APPLICANT : ] 

L-.29 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER ; ] 

L:30 M:220 C: Keyword misspelled or invalid format, f(B) FILING DATE : j 

L:228 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

L:237 M:341 W: (4G) "n" or "Xaa" used, for SEQ ID#:5 

L:2470 M:341 W: (46) "n" Of "Xaa" used, for SEQ ID#:118 

L-.2473 M:341 W: (46) " n " or "Xaa" used, for- SEQ ID#:1.18 

L:26 24 M:341 W: (46) "n" or "Xaa" used, for "SEQ ID#:123 

L:2630 M:34I W: (46) " n " or "Xaa" used, for SEQ ID/I: 123 

L:2633 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 123 
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